Designing an active learning based system for corpus annotation

نویسندگان

  • Bertjan Busser
  • Roser Morante
چکیده

In this paper we review some Active Learning experimental results in order to set up the basis for designing an active learning based system for corpus annotation. Based on the experimental data we design a modular system that allows for initially learning fast, but that it is capable of switching to a slower and more precise learning strategy. The system is designed to perform a semantic role labelling task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Design Process in Student and Instructor

In this paper the designing products of B.A. Sophomore students of architecture in TehranUniversity who were divided into two kinds of learning namely technical and skill-based learning. In technical learningthe subjective steps of creativity process i.e. "insight", "preparation", "incubation", "intuition", and "verification"were discussed and it was suggested that these steps cannot be taught ...

متن کامل

Modeling the Annotation Process for Ancient Corpus Creation

In corpus creation human annotation is expensive. Annotation costs can be minimized through machine learning and active learning, however there are many complex interactions among the machine learner, the active learning technique, the annotation cost, human annotation accuracy, the annotator user interface, and several other elements of the process. For example, we show that changing the way i...

متن کامل

Active Learning Based Corpus Annotation

Opinion Mining aims to automatically acquire useful opinioned information and knowledge in subjective texts. Research of Chinese Opinioned Mining requires the support of annotated corpus for Chinese opinioned-subjective texts. To facilitate the work of corpus annotators, this paper implements an active learning based annotation tool for Chinese opinioned elements which can identify topic, senti...

متن کامل

Video Corpus Annotation Using Active Learning

Concept indexing in multimedia libraries is very useful for users searching and browsing but it is a very challenging research problem as well. Beyond the systems’ implementations issues, semantic indexing is strongly dependent upon the size and quality of the training examples. In this paper, we describe the collaborative annotation system used to annotate the High Level Features (HLF) in the ...

متن کامل

Building a comprehensive syntactic and semantic corpus of Chinese clinical texts

OBJECTIVE To build a comprehensive corpus covering syntactic and semantic annotations of Chinese clinical texts with corresponding annotation guidelines and methods as well as to develop tools trained on the annotated corpus, which supplies baselines for research on Chinese texts in the clinical domain. MATERIALS AND METHODS An iterative annotation method was proposed to train annotators and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Procesamiento del Lenguaje Natural

دوره 35  شماره 

صفحات  -

تاریخ انتشار 2005